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ABSTRACT 


A computerized model for estimating the economic effects of the spread of 

the gypsy moth throughout the United States has been developed. The model, 
GYMMSIM1, uses stochastic simulation in a financial budgeting framework 

over a planning horizon and has been developed to allow a wide range of 
assumptions and policy alternatives to be simulated. This manuscript 
provides general model documentation describing the use of program parameters 
for various scenarios, definitions of variables, a description of the SAS 
data base required for the simulation, and SAS macros used. Technical 
appendices provide information for those who wish to modify, revise, or 
improve the program. 


Key words: Gypsy moth, stochastic simulation, financial budgeting, plan- 
ning horizon, present value, documentation. 
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INTRODUCTION 


The gypsy moth (GM) is a pest which has received increasing attention in 
the past few years. From its origin in Massachusetts in 1869, it has now 
spread throughout New England and the Middle Atlantic States. It is a 
nuisance to residential households, and governmental officials expect an 
increasingly strong public outcry for control or eradication of the pest 

as it spreads to more populated areas. However, the pest has not proven 

to be amenable to accepted pest management practices. Further complicating 
the situation is the fact that GM is spread artifically through commercial 
and other activities such as: vacationers driving campers from one area of 
the country to another; the movement of nursery (forestry) products; and 
the movement of household goods which might contain egg masses of the pest. 
The spread of GM through these type activities is stochastic (random) in 
Mature and could result in GM being “imported” to any region of the country, 
at any time. 


It is now recognized that GM cannot be eradicated from infested parts of 

the Northeast, but hopefully its spread can be controlled. Several alterna- 
tive policies have been proposed to minimize the economic impact of the 
spread of GM. The evaluation of these policy alternatives in terms of 

total costs and benefits was the motivating factor for the development of 
GYMMSIM1, the Gypsy Moth Movement Simulator, version #1. 


GYMMSIM1 is a stochastic computer simulation for modeling the economic 
effects of the spread of GM. It is written in modular, structured form 
using Statistical Analysis System (SAS) as the programming language. This 
allows modification to subportions of the program with relative ease, 
especially for those portions which may require extensive modification as 
more knowledge of the gypsy moth's life cycle becomes available. Thus, 
GYMMSIM1 represents only the first version of a model which may be further 
altered in the future. 


This documentation presents the computer model as presently structured. 
It explains the rationale for the model specification and the simplifying 
assumptions that resulted from time and budget constraints as well as 
limited knowledge of the dynamics involved in the spread of the GM only 
when such information explicitly affects the computer modeling effort. 
Finally, this document assumes that the reader is familiar with SAS. 


BACKGROUND 


The GM was imported into the U.S. in 1869 for use in biological experimen- 
tation. Unfortunately, the pest escaped from its cages in Medford, 
Massachusetts and has been spreading throughout the northeastern part of 
the U.S. The overall northeastern area now occupied by the GM is known as 
the generally infested area (GIA). The boundary between the GIA and the 
remainder of the U.S. to the west and south is known as the leading edge 
(LE). 
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At one time an attempt was made to contain the GM to the New England area. 
For many years the Hudson River area served as a natural barrier in this 
regard. However, a hurricane provided the necessary impetus to breach this 
barrier. The GM then spread throughout the Mid-Atlantic States and has at 
this time reached Virginia. 


As the GIA has expanded over the years, some areas have become severely 
infested by the GM, and large scale defoliation of trees has occurred. For 
the most part these regions have not been important for commercial forestry 
production. However, the pest has been a great nuisance to humans, due 

both to the destruction of the aesthetic qualities of residential properties 
and scenic public areas and also to the unpleasantness of large numbers of 
GM, much as flies are unpleasant at a picnic. The pests are noisy, destroy 
plantlife, and are repugnant, especially when large numbers are present. 
Consequently, communities experiencing these outbreaks have been demanding 
action by public officials. . 


An additional factor encouraging the spread of the GM is human activity. 

We are very mobile creatures and provide the GM many potential opportunities 
to spread. Vacationers from all across the country move in and out of the 
GIA. Nurseries within the GIA ship products out of the GIA. People move 
from within the GIA to regions outside of the GIA. In each of these examples, 
clusters of GM egg masses may be transported to remote areas within hours 

or days-- something which might not occur naturally for hundreds of years. 
As the size of the GIA has increased, the opportunity for spreading the GM 
by these “artificial” means has also increased. Thus, we should expect 

the GM to spread more quickly and to many more areas, some far removed 

from the present GIA. 


Since the GM poses problems as a nuisance, hampering recreational activities 
and causing losses to recreational and related industries, residential 

and business entities will expend resources in an effort to control and 
minimize its disruptive effects. Also, expenditures may be necessary to 
remove and/or replace trees and plants killed by the GM. Moreover, uncertain 
economic impacts loom in the future as the GM moves into areas important 

for lumber and other forestry products. In the long run GM could possibly 

be an important factor affecting the ecology of important lumber producing 
regions of the country. 


Although several governmental agencies are involved, the Animal and Plant 
Health Inspection Service (APHIS) is the primary government agency concerned 
with the gypsy moth problem and has established inspection programs to 
prevent the movement of infested articles of commerce out of the GIA as 
well as other programs to regulate, detect, and eradicate infestations 
which occur outside of the GIA. Thus, APHIS recognizes these potential 
problems, including the fact that the total area affected by the GM is 
increasing rapidly due both to the increasing spread by artificial means 
and to the geometric shape of the GIA. They are also aware that political 
pressure for control of the GM is likely to increase as more communities 
become affected. At the same time, funds for combating the problem are 
limited. As the GM spreads, any given funding level will be used to combat 
the GM over a larger area, diluting the resoures available at any one 
location. The problem is further aggravated by reductions in current 
funding levels. Thus, if we are to stand any chance of controlling the 
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spread of GM, we must use the most efficient program possible. APHIS must 
decide whether it is beneficial to conduct any program to combat the GM 
and if so, which program would be most effective. 


In this regard, many alternative programs are possible. They include: 
(1) having no APHIS program, (2) having only the APHIS inspection program 
within the GIA, (3) continuing only the present APHIS eradication program 
for isolated infestations outside of the GIA, and (4) continuing both the 
inspection and eradication programs as presently implemented. 


The objective of the GYMMSIM1 computer model is to analyze these and other 
possible program options under various scenerios to determine the resultant 
total costs and benefits. 


The following section provides a general description of the model including 
rationale for the various parameters required for any given program option 
and scenario. A detailed list of variables and definitions then follows. 
(A detailed flow chart of the model is provided by Appendix C). 


GENERAL MODEL DESCRIPTION 


The following narrative describes several general scenarios and how the 
computer models these scenarios. Parameters which must be specified are 
listed in parentheses (all capital letters) in that part of the narrative 
which first describes their function. Finally, a discussion of other 
parameters relating to the APHIS program and a discussion of modeling the 
movement of the LE of the GIA is presented. 


One computer run can give multiple simulations (NS). The present value of 
each decade within the planning horizon of each simulation is listed separately. 
Thus, a distribution may be derived if desired. 


Base Scenario 


If no action is taken by APHIS, the LE of the GIA will move south and west 
year by year at a “natural” rate. This natural movement, which ocurs 
without any government program, forms the “base scenerio” against which 
other programs are measured. There is no measurement of losses behind the 
LE since no action will be taken in this area under any circumstances. 

The present value of losses occurring in front of the LE provides the 
criteria against which other programs are evaluated. The present value, of 
course, varies with the discount rate (R) used for the simulation. 


Retarding Movement of the Leading Edge 


If action is taken to retard the movement of the LE, the GIA will grow at 
a slower rate than otherwise, and losses outside the initial position of 
the LE will be delayed or prevented, depending on the rate of movement of 
the LE. There is a certain program cost associated with the prevention 
program each year as well as losses which do not occur each year because 
of the program but would have occured under the base scenario. These 
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costs and prevented losses can be tabulated over the planning horizon 
(YRS) of the simulation, and their present value calculated. The total 
cost of retarding the movement of the LE (COSTNAT) the initial year of the 
simulation, is entered. This cost, over time, is a linear function of the 
total length of the LE. 


The losses prevented by a program designed to retard the movement of the 
leading edge include those losses which do not occur in the area between 
the position of the LE and the corresponding position of the LE in the 

base scenerio. In Figure 1, the initial area of the general infested area 
spreading southward and westward is represented by that portion of the 
sector from the center to the radius Ros Under the base scenario, the GIA 
would expand at a rate (VO) and the LE advances to radius RT ba sete 

the end of the first year. However, a program designed to slow the spread 
of the GIA limits the advancement of the LE to-R) potardeq? at a rate (v0O0O) 
equal to or less than (VO). The area between these two radii, represented 
by the hatched areas in Figure 1, represents the area that would have been 
absorbed into the GIA under the base scenario, but which has been prevented 
from occuring by a program to slow the movement of the LE. The area in 
which these losses are prevented is assigned a constant loss factor 
(contained in ADJF) for each of the timber, recreation, and residential 
categories multiplied by the proportion of the total area receiving the 
timber, recreation, and/or residential loss (ADJEFF (2, 1,), ADJEFF (2, 

2), and ADJEFF (2, 3), respectively). Since the objective function is 
measured in terms of the value of costs and losses, these prevented losses 
enter the objective function as a negative number. 


Untreated Isolated Infestations 


The artificial spread of the GM may also be modeled. This spread is 
clearly stochastic in nature. A camper, a shipment of Christmas trees, or 
a shipment of household goods containing a cluster of GM eggs are examples 
of activities which could lead to an isolated infestation being spread to 
any part of the country, inside or outside the GIA. The various categories 
of activities could be modeled separately, but, lacking essential parameter 
values for these activities, we assume that the probability of any city 
(SMSA or standard metropolitan statistical area) receiving an isolated 
infestation is directly related to the population of the SMSA (a proxy for 
the amount of commercial activity occurring) and inversely related to the 
distance of the SMSA from the LE (the further away from the LE, the less 
commercial activity associated with cities within the GIA). Given the 
number of isolated infestations occurring each year of the planning horizon 
(NI), a random number generation with a given seed (SEED) can then choose 
the SMSA's which receive an isolated infestation(s) during the year. (The 
SMSA is actually only a proxy for the general overall geographical area 
surrounding the city. However since most of the population within the 
general area is near or within the SMSA, it should be a good proxy.) 


Assuming an initial size for an isolated infestion (AREAO) and a yearly 
growth rate (GR) for the area covered by an isolated infestation, we may 
calculate the area covered by the isolated infestation year by year until 
the SMSA is absorbed into the GIA or until the end of the planning horizon. 
(When the isolated infestation is absorbed into the GIA, it no longer 





Figure 1. MOVEMENT OF THE LEADING EDGE--BASE SCENARIO VERSUS A PROGRAM 
TO RETARD THE MOVEMENT OF THE LEADING EDGE. 
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causes additional losses above those occurring in the base scenerio). 

Over the planning horizon a city may receive many isolated infestations. 
There is some probability that a new isolated infestation may occur in a 
specific area within a SMSA which is already covered by a previous isolated 
infestation. This is particularly true after a number of isolated infesta- 
tions within a given SMSA have grown for a number of years. The modei 

does not attempt to model this phenomenon stochastically, but assumes that 
all isolated infestations up to a specified number (NINFMAX) are separate 
and independant. All isolated infestations above this number are assumed 
to occur in an area already infested and thus do not add to the total 
infested area. 


The model has taken a given isolated infestation, randomly assigned it to a 
SMSA, and allowed the area covered by the isolated infestation to grow 

over time. Associated with this area is a loss function to determine the 
losses caused by the particular isolated infestation over the various 

years it is in effect. 


Each SMSA could have a different amount of loss from lumber, recreational, 
and residential categories. The computer model provides this option by 
allowing the programmer to specify individual loss values for each city by 
category of loss (see the section on SAS Data Base). The computer program 
allows assigned loss values to be “blown up” or scaled by a constant factor 
(ADJF). (This feature is particularly useful if all cities have the same 
loss for any given category. That category can be represented by a 0-l 
dummy variable and multiplied by the scale factor to derive the actual 

loss value. Recognizing -that only a portion of any isolated, infested 

area will suffer losses, the program has an effective loss factor (ADJEFF 
(1, 1), ADJEFF (1, 2), and ADJEFF (1, 3) for timber, recreational, and 
residential losses) to calculate that proportion of the area to be assigned 
an economic loss. In addition, the program recognizes that residential 

. density is not constant throughout a SMSA and the surrounding area. This 
is accounted for by keeping a yearly running total of an SMSA's total 
infested area. The sum of all infested areas up to a given stated proportion 
(PART) of a SMSA's total area are assumed to receive the total residential 
loss function. Any infested area above the stated proportion is assigned 
only a fraction (FRACT) of the original residential loss function. Finally, 
an isolated infestation does not immediately cause damages even though it 
occupies an area. An incubation period of several years (SKIP) is required 
for the pest to develop to the point of causing severe defoliation and 
accompanying economic losses. After this period, the model assumes the 
pest causes economic losses. 


Eradicated Isolated Infestations 


When a city receives an isolated infestation that is not treated, the 
growth process described in the previous section occurs. However, the 
model provides for the eradication of the infestation with 100 percent 
effectiveness at a specified cost per infestation eradicated (ERADCOST). 
In this case the isolated infestations do not grow and cause losses. 


However, there are several situations in which an isolated infestation 
would not be eradicated even though an eradication program is in effect. 
This includes cities in which the environment is deemed too severe for the 
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GM to survive and thrive (all categories of the SMSA's loss functions in 
the SAS data base are zero) as well as those cities so close to the LE 
(less than BORDER) that the city will be absorbed into the GIA within a 
relatively short period of time. 


APHIS Program Parameters 


Several parameters which relate to the APHIS program must be specified. 

As previously described, the first year total cost (COSTNAT) of retarding 
the movement of the LE must be entered. Costs in subsequent years are a 
linear function of the length of the LE. If there is no program to retard 
the LE, COSTNAT = 0. The initial total cost of the APHIS regulation and 
detection program (FC) must be entered as well as the average cost per 
Square mile (VC) of this program . Total costs in subsequent years are 
then calculated as a linear function of the total area within the GIA. 
Finally, as previously, described, the cost of eradicating an isolated 
infestation (ERADCOST) must be entered. 


The number of isolated infestations each year (NI) has been entered to 
reflect expert opinion. This could later be modified to reflect some as 

yet unspecified functional relationship. In addition, APHIS could undertake 
a program to prevent some percentage (REGEFF) of potential isolated infes- 
tations from actually occurring. The computer model allows for this by 
multiplying the yearly number of new isolated infestations (NI) by the 
percentage of potential isolated infestations actually occurring (1-REGEFF) 
and rounding off the answer. 


Modeling the Leading Edge 


Although the GIA is actually irregularly shaped, that portion of the GIA 
facing and expanding south and west is basically shaped as a sector of a 
circle (see Figure 1). Thus, for modeling purposes, the LE is the perimeter 
“F" of a sector of a circle whose radius increases each year as the natural 
spread of the GM expands the GIA. Several model parameters are dual valued 
so that, if desired, future versions of the model may have two different 
sectors of two different concentric circles to represent the GIA. In this 
manner different parts of the GIA would be allowed to expand at different 
rates. 


Data Requirements 


The model requires a data base containing information about the SMSA's 
outside of the GIA. It also requires a matrix giving the probabilities of 
any given city receiving an isolated infestation in any given year. Finally, 
the model requires that the various parameters as described previously, be 
specified. These parameters are contained at the beginning of the program 
in a macro named PERIMETE. Several of these parameters are multivalued; 
thus, matrix dimensions are included in the following definitions. 
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VARIABLES 


The following variables are contained in macro PARAMETE. This macro occurs 
within the SAS PROC MATRIX procedure, and all variables must be input in 
proper matrix form. Dimensions are noted in subscripts in the following 
descriptions: 


Ree jer ithe discount rate (yearly rate of interest), expressed as a decimal. 


NS#¥) & the number of simulations to make in the computer run. (Each 
simulation starts at the initial position of the leading edge 
and progresses to the end of the planning horizon.) 


YRS} x] = the number of years (planning horizon) contained in each simulation. 


SEED},1 = 7 digit odd integer to control the start of the random number 
generator. 


GRo,1 = the yearly growth rate of the area of an isolated infestation 
expressed as a decimal. Two different growth rates are allowed. 


AREAO } 41 = the assumed initial area (in acres) of a new isolated infestation. 


NINFMAX) , 1 = the number of independent isolated infestations a given city may 
receive. Any isolated infestation above this number is implicitly 
assumed to be within an area of the SMSA previously infested. 
Thus, this new infestation does not increase the total area of 
isolated infestations within the SMSA. 


PART) 15 that proportion of the area of a SMSA which receives the full value 
of the residential loss function. When the total area of the variou: 
isolated infestations within a given SMSA exceeds this proportion 
of the SMSA's total area, this “excess” portion of the infested 
area receives only a fraction of the original residential loss 
function. 


FRACT) 1 = that fraction of the residential loss function assigned to that 
part of the infested area above a given proportion of the SMSA‘'s 
total area (See PART, above). 


COSTNAT 1 = the APHIS cost of limiting the yearly movement of the leading 
edge at the beginning of the planning horizon. (See section on 
the appropriate COST macro for specific data to include in COSTNAT). 


FC) x] = the initial cost of the APHIS inspection program at the beginning 
of the planning horizon. (See section on the appropriate COST macro 
for specific data to include in FC). 


VCi x1 = additional APHIS costs which change over time. (See section on 
the appropriate COST macro for specific data to include in VC). 


ERADCOST | 1 = the cost of eradicating each isolated infestation. 


== 


BORDER}, = the distance beyond the LE of the GIA within which no 


eradication program will take place even though an isolated 
infestation occurs. 


WIDTH,,, = width (in degrees) of the sector within each of the two 
concentric circles used in the model to represent the expanding 
portion of the GIA. 


SKIP),; = the number of years from the beginning of an isolated infestation 
through the last year within which no monetary loss is precieved. 
In other words, the “build up” period of the GM infestation 
before it causes damages. 


ADJ 5.3 = multiplicative adjustment factor for (1) lumber, (2) recreational, 
and (3) residential losses. The data base required by the program 
contains a loss factor for each of these categories for each SMSA. 
Row 1 of this variable allows these loss factors to be scaled for 
calculating losses from an isolated infestation. Row 2 allows 
different scaling factors to be used in calculating “losses 
prevented” by a policy which retards movement of the leading 
edge (see figure 1). . 


ADJEFF 5,3 = adjustment factor giving that proportion of an infested area F 
actually receiving economic losses for (1) lumber, (2) recreational, 
and (3) residential losses. Row 1 gives the proportions for an 
isolated infestation within an SMSA while row 2 gives the . 
proportions for the total area which has been prevented from being 
absorbed into the GIA by a program to retard the movement of the — 
leading edge. 


NI xyes = the number of isolated infestations assumed to occur each year of 
the planning horizon. 


VO;x) = rate of movement of LE in base run (miles per year). 
VOO;,; = rate of movement of LE in this scenerio (miles per year). 


Macro PARAMETE also contains variables ISOGROW, KOST, and YRDATA. These 
variables allow the selection of different ISOGROW, KOST, and YRDATA macros 
(through the use of IF statements) depending on modeling considerations. 

At this stage of model development, there are no alternative macros for 
ISOGROW, KOST, and YRDATA; thus, these variables should all be assigned a 
value of l. 


SAS DATA BASE 


The final item which may be varied for policy analysis is the SAS data 
base being used for the analysis. This data base has been processed by a 
previous SAS program which takes the original data for each SMSA outside 
of the GIA at the start of the simulation, processes it, and outputs it to 
a SAS data base in a form needed by GYMMSYM1. The data base is composed 
of three SAS data sets, each of which is needed by GYMMSYM1. Each of the 
data sets has a first level name which is the same as the DD-name of the 
DD statement giving the operating system (0S) data set name which contains 
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the SAS data base. The following sections, headed by the SAS data set 
second level name, describe the information contained in each SAS data 
set. 


INFO. This data set contains information about each SMSA. It has been 
sorted by state and SMSA (city), and contains the following variables: 





CITY — an alphanumeric variable containing the name of the SMSA 


STATE - an alphanumeric variable containing the post office abbreviation 
of the name of the state. 


POP78 - SMSA populations (in 1978). 


DIST = the distance, in miles, from the SMSA to University Park, Pa., 
the center of the concentric circles used in the model to 
represent that portion of the GIA which moves south and west. 


DUMMY - a dummy variable with one of the following values: 


0 —- the environment of the region surrounding the SMSA is such that 
the isolated infestation is not likely to survive and/or grow. 


1 - the isolated infestation will grow at a slow rate (value 
contained in GR(1, 1)). 


2 - the isolated infestation will grow at a fast rate (value 
contained in GR(2, 1)). 


LUMLOSS - the average dollar loss (per square mile) of timber caused 
by a GM infestation in the general region of this SMSA. 
(If the loss is an amount which remains constant (or zero) 
over all SMSA's, this value can be 1 and then multiplied by 
ADJF.) 


RECLOSS - the average dollar loss (per square mile) of recreational 
output in the general region of this SMSA. (Also l, if 
uniform losses are assumed.) 


RESLOSS - the average dollar loss or cost (per household) of residents 
in the SMSA (Also may be 1. However, this would be a poor 
assumption for this variable due to the different populations, 
and demographic characteristics among the SMSA's.) 


CITYAREA = the area of each SMSA, expressed in square miles. 


VR -— determines which concentric sector the SMSA is in, the sector 
within which the LE is progressing at a slow or fast rate, 
as given in V5,,, the rate of movement of the LE. 


PROB. This SAS data set contains a matrix which gives information on the 
probability of any city (SMSA) receiving a given isolated infestation. It 
is in the form of a cumulative distribution function (CDF) for each year 

since these probabilities, as presently specified, are in part a function 


== 


of the distance of the SMSA from the LE and thus, vary over time. Therefore, 
this data set is in the form of a matrix in which the rows are the cumulative 
probability of any city receiving a random infestation in a given year while 
the columns represent these probabilities for the various years. 


PARA]. The probability of any city receiving an isolated infestation is a 
function of: 


EDGEO5,, = the distance (miles) of the LE from the center of the 
concentric circles (University Park) at the start of the 
simulation. 


Veet the rate of movement (miles per year) of the LE as the GIA expands. 


KR = the threshold distance from the LE beyond which the probability 
of any SMSA receiving an isolated infestation decreases with 
distance. (At any distance less than KR, the probability of a 
SMSA receiving an isolated infestation does not vary with distance 
and is the same as it would be at KR.) 


YRS1 = the number of years for which the CDF's have been calculated. 


Since the probabilities in the cumulative distribution functions depend on 
the values of the variables EDGEO, V, KR, and YRS], these variables are 
stored in a SAS data set and have been concatenated into a 6xl array named 
PARA. This prevents a programmer from inadvertently changing the value(s) 
of any of these variable(s) within the program and thus, running the program 
with an incorrect CDF. If the rate of movement of the leading edge selected 
for the simulation (VOO) is not equal to V, the rate of movement of the 
leading edge used to generate the CDF, a warning message is written, and 


the simulation portion of the program terminates. 
MACROS 


Several portions of GYMMSYM1 are contained in SAS macros. This makes it 
possible to have several versions of these portions of the program and 
choose among the alternatives through the use of IF statements associated 
with the variables ISOGROW, KOST, and YRDATA as explained in the section 
on variables. 


PARAMETE. The beginning of this macro contains NOTE statements which 

start the output section of the computer printout. These notes describe 

the scenerio being run and should be updated to reflect the new scenerio 
when any parameter is changed. The remainder of PARAMETE contains parameters 
which may change from simulation to simulation. These parameters have been 
defined in the section on variables, and unlike the separate data input 
section used in a compiled Fortran program, all parameters in this SAS 
simulation program are entered in macro PARAMETE (the first section of the 
SAS program) using SAS assignment statements in proper matrix notation. 


i= 


KOST1. This macro calculates the yearly costs of all APHIS programs 
(excluding eradication) for the planning horizon. These costs are con- 
tained in variables FC, COSTNAT, and VC and as modeled in KOST1 represent 
costs which are fixed, are a linear function of the length of the LE, and 
are a linear function of the area absorbed into the GIA after the start of 
the simulation, i.e. the total area of the GIA at year J minus the initial 
area of the GIA at year zero. For example, these costs, as modeled in 
KOST1, consist of two possible components: the inspection program and the 
program to slow the expansion of the LE. The cost of the inspection programs 
is a function of the area within the GIA. The yearly cost of inspecting 
the area (at time = 0) within the GIA is included in variable FC while the 
cost of inspecting an additional square mile is contained in VC. The cost 
of slowing the natural spread of GM is a linear function of the length of 
the LE. The initial cost of this retardation program for the initial 
length of the LE is contained in COSTNAT. 


ISOGROW1. This macro takes the assumed size of an initial infestation (in 
acres), converts it to square miles, and calculates the size of an isolated 
infestation starting at year 1 and continuing throughout the planning 
horizon. This is done for each of the two growth rates given in GR. The 
resulting areas are stored in AREA» yyRS° - 


YRDATA1. This macro forms the heart of the simulation and is the most 
complex portion of the program. If ERADCOST, the cost of eradicating an 
isolated infestation, is any amount other than zero, the macro checks to 
see if the infestation is the required minimum distance from the LE, and 
is in an SMSA which will suffer economic loss, and if so, increments the 
eradication costs and APHIS eradication program counter for that year. 


If ERADCOST is zero, the macro checks to see that the city in question has 
not exceeded its maximum number of independent, isolated infestations. If 
not, it calculates the number of years until the LE of the GIA will reach 
the city. Then, if the SMSA will suffer economic losses, the macro uses 
the appropriate area of an isolated infestation over the time frame, and 
calculates the losses from timber, recreational, and residential sources, 
after the end of the “incubation” period of the infestation and throughout 
the years the isolated infestation is a factor in causing economic losses. 
These losses are accumulated for all cities for each year in the planning 
horizon. 


SUMMARY 


This documentation has presented a brief background of the spread of the GM 
and a general model description that includes examples of scenerios which 
may be simulated by the model. These scenerios show how the parameters 

are used in the model. A formal definition of the variables, a description 
of the macros used in the program, and a description of the SAS data base 
required by the model then follows. 


fice. 


These sections provide the essential information needed to use the computer 
program to simulate different policy alternatives. Additional information 
needed to run the program at Washington Computer Center is contaired in 
Appendix A, which lists the job control cards needed for the IBM system. 


Appendix B contains a description of the program output along with a sample 
output. 


Appendices C and D provide more technical information on the program, 
oriented toward programmers and model builders who may wish to modify the 
model. Appendix C provides a description of the program with accompanying 
flow charts. Finally, Appendix D provides a source program listing 
corresponding to the sample output in Appendix B. The listing includes 
many comments which will also be helpful to those interested in the more 
technical aspects of the model. 
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JCL CONSIDERATIONS 


GYMMSIM1 is programmed in SAS, version 79.5 and has been run on the IBM 
computer system at the U.S.D.A. Washington Computer Center (WCC). The 
complete program needed to run the model at WCC is shown below: 


A. Control cards: 
1. Job Card 
2. //SAS EXEC SAS,PARM='S=0' ,Region=340K 
3. //APHIS2 DD DISP=SHR,DSN=ERS23.ER6HF .SASDS.AHPHIS2.DATA 
4. //*APHIS2 EDGEO=172,V=10,KR=100, YRS=30 
5. //APHIS3 DD DISP=SHR,DSN=ERS23.ER6HF.SASDS.APHIS3.DATA 
6. //*APHIS3 EDGEO=172,V=2,KR=100, YRS=30 
7. //SAS.SYSIN DD * 
B. SAS Source Program (contained in Appendix D). 


The "//SAS" card in the control section tells the computer to use SAS, 
version 79.5. Only the “//APHIS2" or “//APHIS3" card is needed for any given 
simulation run. APHIS2 and APHIS3 are the DD name which correspond to the 
first level names of the three SAS data sets used in the program. The "//*" 
cards are comment cards which give the DD name and parameter values applicable 
to each data set. For example, the OS data set named ERS23.ER6HF.SASDS. 
APHIS2.DATA was derived and based on the parameters: EDGEO=172 miles, 

V=10 miles/year, KR=100 miles and YRS=30 years. (Although these two listed 
SAS data bases are the only SAS data bases at the present time, others may 

be added at a later date.) The “//SAS.SYSIN DD *" card tells the computer 
that the SAS source program immediately follows this card. 


The CPU time needed to execute a program varies considerably, depending on 
parameters. For NS = 1, a planning horizon of 30 years, and total number 

of new isolated infestations not more than 2000, execution time has varied 
between 5 and 50 seconds. Scenerios which do not use an eradication program 
take the most time for execution due to the growth of the infestations over 
the various years and calculation of the loss functions year by year for 
each infestation. 
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APPENDIX B 


OUTPUT DESCRIPTION 


AND SAMPLE OUTPUT 


== 
GYMMSIM1 OUTPUT 


The initial section of a simulation output provides a note describing 
the scenerio being simulated. This is followed by a listing of all parameter 
values (in SAS matrix notation) accompanied by a note describing the values 
being presented. Some summary statistics from program calculations are 
also presented in this section. 

The second section provides summary statistics for the computer run 
and includes on a yearly basis: the total number of new isolated infestations, 
the grand total area of isolated infestations beyond the LE, the grand 
total area added to the GIA since the start of the simulation, value of 
timber, recreational, and resdential losses for that year, APHIS inspection 
costs, eradiction costs and total costs for the year, and sum total of 
losses due to the pest plus APHIS costs for that year. These statistics 
are average values per simulation. The third section provides information 
similiar to the second section. However, all dollar figures from the 
beginning of the simulation through the applicable year have been monearted 
to a present value basis. The final output section lists alphabetically 
by state, all SMSA's in the data set. It also lists the average number of 
isolated infestations per simulation from the start of each simulation 
through the end of each decade, and also some of the more pertinant input 
data about each SMSA including population in 1978, distance of the SMSA 
from Harrisburg, Pennsylvania, loss factors for timber, recreation, and 


residential areas, and area of each SMSA (in square miles). 
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PROGRAM DESCRIPTION AND FLOW CHARTS 


Main Program 


The main program reads the appropriate 
SAS data set to input data about the 


SMSA's, then converts to the PROC 
MATRIX procedure. After calling 
macros PARAMETE, ISOCOST and KOST, 
the main program does preliminary 
conversion of data values and 
writes system parameters and 
relevant summary statistics to 

the output file. It then creates 
appropriate matrices needed by 

the vrogram before starting the 
actual simulation. After starting 
the simulation, appropriate 
matrices are zeroed out. Then 

for each year of the planning 
horizon, the location of the LE 

is calculated. For each isolated 
infestation which occurs each year, 
a city is randomly selected, macro 
YRDATA called, and the number of 
isolated infestations for the 

year have occurred, the total 
number of infestations eradicated 
and associated costs are tallied. 
APHIS inspection and eradication 
costs and losses are calculated. 
Losses and total APHIS costs are 
then converted to a future 

value basis, saving those future 
values which correspond to the 

end of each decade. 
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Input SAS 
data set 
SIM, 










Macro 
PARAMETE 





Preliminary 





calculations of 
Parameter values 









Write Notes 
and 
matrix 
\ values 






Convert SAS 
data set to 
matrix form 






Calculate area of SMNSA'‘s 






Transform values of LOSSES (if 
necessary) 






Concatenate reduced residential 
loss values to LOSSES 





Input from SAS data set: 
(1) CDF matrix 







(2) Parameters svecific 
to this CDF matrix 






Write values of Para- 
meters specific to 
CDF matrix desired 
parameter specified 









Write 
, Warning 
Note 











YRS =_ YRS 
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PROGRAM DESCRIPTION AND FLOW CHARTS--Continued 


| 


These values are then discounted 
to determine the present values 








e Zero out matrices 
of losses and costs for the various ada oar avs <u 
planning horizons (by decade). If simulation 


more than one simulation occurs in 
the computer run, appropriate 
Statistics are then converted to 
an average per simulation basis. 
Results are then placed in a 
format suitable for output, and 
dollar values converted to a 
thousand dollar basis. Output reine edretnee 
matrices are output to a SAS number eradicated 
data set. Additional notes and nie 
Statistics are written to the 
output file. Finally, data sets 
are written to the output file 
which summarize statistics by ene hietd 
year and give the number of 

isolated infestations for each 
city by decade along with data 
on each SMSA. 


Assign values to 
PV matrix 











Choose SMSA from 
cumulative function 
table 





Increment number 
times city has been 
infested (by decade) 


Zero out matrices 
needed for computer 
Tun 


















Appropriate 
ISOGROW macro. 
Appropriate 
YRDATA macro. 





Calculate total 
number isolated 
infestations (TNI) 









Have all 
infestations for 
this year 
occurred? 





Appropriate 
KOST macro. 









Write Notes & 
Parameter Values 





‘PROGRAM DESCRIPTION AND FLOW CHARTS--Continued 


Increment 
(1) Total # eradications 








More than 1! 
simulation this 
Tun? 


(2) Total cost of 
eradications 


Yes 
















Calculate av. 
values/simulation 






Any years 
left in planning 
horizon? 


rc.) 


No 


(1) Calculate total 
APHIS costs and 
grand total all 
costs and losses 


Increment total 
infestations each city 
for all simulations 


Convert dollar 
figures to thou- 
sands of dollars 


Calculate present | 
value costs and 
losses through 
each year 


Determine all 
costs & losses for 
this simulation 









(1) Determine future 
values of losses 

& costs this 

simulation 









e 


Output matrices 
to SAS data 
sets 





Save those values 
corresponding to the 
end of the decade(s)| 
















Write notes 
and matrix values 










All 
simulation 







Discount future value 
of losses and costs 
to 
present value 


(1) Write Yearly Summary of Simulation 
(A) Losses and costs by year 


(B) PV losses and costs through 
years 





(2) Write Av. # simulations per 
city and SMSA input data 
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| .PROGRAM DESCRIPTION AND FLOW CHART--Continued 





‘Macro PARAMETE Macro ISOGROW 1 

Sa ee 

Tais macro contains This macro calculates the 

| a note describing the scenerio size of an isolated infestation at 
for the computer output. It any given time given the infestation's 
also contains input values for initial size, two possible rates of 
all parameters in the simulation. growth, and maximum period for 


growth (planning horizon). 


Macro KOST 1 


This macro calculates the 
increase in size of the GIA 
over the planning horizon 
and all APHIS costs excluding 
eradication costs. 












Enter 
ISOGROW 1 







Calculate LENCOST 
(cost/unit length 


of the LE) Compute area of isola- 


ted infestation for 
year J (in sq. miles) 
for both growth rates 










Calculate length 
of the LE this year 






Calculate increase 
in area of GIA from 

initial size to present 
size this year 


Calculate APHIS costs 
for year = f (Length 
LE) (excluding 
eradication costs) 
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PROGRAM DESCRIPTION AND FLOW CHARTS--Continued 








Enter 
YRDATAI 











Loss>0 
and SMSA 
nin 
distance 
from LE 







# infesta- 
ions in city greater 
than maximum 
allowed? 











Increment counter for 
infestation eradicated 


Macro YRDATA1 




































Calc. # years this 
isolated infestation 
will cause losses 


This macro takes the city 

receiving an isolated infesta- 

tion and 

(1) eradicates the infestation 
if (a) Jan eradication 
program is in effect, (b) 
economic losses would occur 
in the area if eradication 
is not undertaken, and (c) 
the city is not within the 
required distance of the 
leading edge, or 

(2) determines whether or not 
the infestation is "indepen- 
dent" of other isolated 
infestations in the SMSA 
(the number is less than a 
predetermined limit) and if 
so, (a) determines the 
number of years the infesta- 
tion will grow, (b) calcu- 
lates total city area 
infested as well as grand 
total area infested for 
all sMSA 1s. and) (c) stor 
years in which area is large 
enourh to cause economic 
losses, calculates the 
economic loss and summarizes 
it in a table by year. : 





Calculate total 
infested area in SMSA 
& grand total 
infested area 






“Incubation 
period finished? 


Calculate dollar loss for 
this city and year, and 
tabulate in proper loss 

category 


Total years 
of losses over? 
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